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ABSTRACT 

Context. Westerlund I is the richest young cluster currently known in our Galaxy, making it one of the most massive clusters for 
which we can resolve the individual stars even in the crowded centre. This makes it an ideal target to assess whether massive clusters 
formed currently will remain bound or will disperse and contribute significantly to the stellar field population. 

Aims. Here we report a measurement of the radial velocity dispersion of Westerlund I to explore whether the cluster is currently in 
virial equilibrium, if it is in the process of collapse or if it is expanding and dispersing into the field. 

Mettiods. We obtained MIKE/Magellan high resolution optical spectra of 22 post main-sequence stars in Westerlund I for 2 or 3 
epochs with a maximum baseline of about one year. Radial velocities variations between these spectra have been measured through 
cross correlation. 

Results. We calculate the velocity dispersion from the cross correlation of five yellow hypergiants and one luminous blue variable, 
that show little radial velocity variations between epochs and have many spectral features in common. After taking into account the 
effect of small number statistics and undetected binaries, we estimate the velocity dispersion for the massive stars in Westerlund I to 
be 2.1+^ ] km s"'. For several different assumptions concerning possible mass segregation and the elongation of the cluster, we find 
that Westerlund I is subvirial at the 90% confidence level. 

Conclusions. We can rule out that the cluster is significantly supervirial at the 97% confidence level, indicating that Westerlund I is 
currently bound. This implies that Westerlund I has survived past the point where any gas expulsion has taken place and is expected 
to survive for billions of years. 
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1. Introduction 



Soon after its launch about twenty years ago, the Hubble Space 
Telescope identified many massive young star clusters in star- 
burst and merging galaxies ( Holtzman et al.|1992| l. These clus- 
ters are massive enough that they appear unlikely to disperse 
in a Hubble time due to tidal stripping ( |Lamers et al.|[2005] l, 
which means they could possibly become future globular clus- 
ters. However, violent internal or external events could cause 
the cluster to become unbound, in which case the cluster will 
dissolve into the field, significantly contributing to the stellar 
populations of their host galaxy. A likely candidate for such an 
event would be the expulsion by stellar winds or the first super- 
novae of the remainder of the gas cloud out of which the cluster 
formed. This scenario has been extensively studied both analyt- 
ically (pills 1980; Elmegre en 1983, M athieu|I983|l and with N - 
body simulations ( |Lada et a l. 1984; Goodwin & Bastian|2006| . 
These studies find that the expulsion of gas lowers the potential 
well in which the stars reside, causing a previously virial clus- 
ter to expand. The efficacy of the gas expulsion to disrupt clus- 
ters depends on several variables, including the star formation 
efficiency, the timescale of the expulsion (Hills 1980; Mathieuj 
I983[ l and whether the stars have achieved virial equilibrium 
with the potential of the surrounding gas before the gas is ejected 
( |Goodwin|2009| l . Without studying the dynamical state of these 
clusters, we do not know which fraction of them would survive 
to become future globular clusters analogues and which ones 
will disperse into the field. 



The only information on the dynamic state of these distant 
massive clusters, that we can currently obtain is the radial veloc- 
ity dispersion (e.g.lBastian et al. 2006 1. Under the assumption of 



virial equilibrium the one-dimensional velocity dispersion can 
be used to estimate the mass of the cluster, if its size is resolved. 
This mass estimate is called the dynamical mass. The dynami- 
cal mass can be compared to the mass derived from the lumi- 
nosity of the cluster, assuming a universal IMF. Many clusters 
have been found to have a significantly higher velocity disper- 
sion, than expected from the luminosity (Bast ian et al.|2006) l. 
Several explanations have been proposed for this, including: (i) 
the IMF might actually be different under extreme star form- 
ing conditions (Mengel et al. 2002), (ii) the velocity dispersion 
might be significantly increased due to binaries ( )Kouwenhovenl 
|& de Grijs|2008[|Gieles et al.|20I0[ ), or (iii) the cluster might be 
supervirial, for example due to a recent expulsion of gas from the 
cluster (Goodwin & Bastian 2006; Baumgardt & Kroupa 2007jl. 
These scenarios can be distinguished through observations of a 
cluster for which we can resolve the individual stars and thus 
measure the binary properties and the IMF. 

Westerlund I is the most massive young cluster currently 
known in our Galaxy (Portegies Zwart et al. 2010 ). Photometric 
observations with the NTT/SOFI in the JHKs bands ( |Brandner| 
et al.|2008{ Gennaro et al.,201 1, ) show an elongated cluster with 
a mass of 4.9'^^^ x 10"* Mq, a half mass radius rhm = 1.0 pc, and 
a distance of 4 + 0.2 kpc. This distance is confirmed by com- 
paring the radial velocity of the H II gas in Westerlund I with 
the rotation curve of the galaxy ( Kothes & Dougherty|2007 1. At 
this distance from the galactic centre stellar evolution and galac- 
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tic tidal stripping are expected to disperse Westerlund I over 



9 Gyr, if it is currently in virial equilibrium (Baumgardt & 
Makino|[2003j ). Thus Westerlund I might represent a precursor 



of a future low-mass, metal-rich "globular cluster". However, at 
an age of 4.5-5 Myr ( [Crowther et al.|2006[|Gennaro et al.|201 l| l 
we expect that Westerlund I might be out of virial equilibrium 
due to any recent expulsion of gas, which has not been con- 
verted into stars ( Goodwin & Bastian 2006). In this paper we 
describe our measurements of the radial velocity dispersion in 
Westerlund I to explore whether the cluster is likely to be bound 
or not. If Westerlund I is in virial equilibrium we expect a one- 
dimensional velocity dispersion of ~ 4.6 km s ' (see Sect. ^ 
for the mass and radius measured by .Brandner et al.,(2008j and 
|Gennaroet"aL|(|2011|l 



Mengel & Tacconi-Garman ( 2007 2009 1 previously mea 



sured the radial velocity dispersion of Westerlund I using K- 
band spectra. They found a velocity dispersion of 9.2 + 2.5 
km s ' based on a sample of ten stars. This value could be 



severely inflated due to the presence of binaries (Kouwenhoven 
& de Grijs||2008| [Gieles et al.|[20T0| . [Ritchie et al.| ( |2009a|b| l 



have also measured the radial velocities for a large sample 
of members and candidate members of Westerlund I using 
VLT/FLAMES+GIRAFFE spectra from 8350 to 9000 A over 
multiple epochs to check for variability and investigate the bi- 
nary properties of these massive stars. 

In this paper we present multi-epoch high-resolution spec- 
troscopy to find the velocity dispersion corrected for bina- 
ries of Westerlund I and thus constrain the dynamical state of 
Westerlund I. In Sect. |2] we present the observations and discuss 
the data reduction. We present the radial velocity variations be- 
tween epochs due to binaries or atmospheric instabilities in the 
target stars, and calculate the velocity dispersion using the stars 
with low variability in Sect. [3] The implications of the measured 
velocity dispersion on the dynamical state of Westerlund I are 
discussed in Sect. [4] Finally we present our conclusions in Sect. 



2. Observations & data reduction 

Observations were made using the MIKE spectrograph 
( Bernstein et al.|2003 i located on the Magellan Clay telescope 
at the Las Campanas Observatory in June 2009, August 2009 
and July 2010. MIKE has a red and a blue arm, providing two 
echelle spectra with a combined wavelength coverage from 3200 
to 9000 A. A slit of 0.7 arcseconds was used, cotTesponding 
to a resolution of 53.000 (~ 6 km s ' per resolution element). 
Due to the large interstellar reddening towards Westerlund I 
(Ay = 9.6 - 12.3 [Negueruela et al.||2010| l usable spectra are 
only obtained above 5000 A. We observed 22 of the brightest 
spectroscopically confirmed members (/ < 14.7 mag) selected 
from (Clark et al. (2005). These stars include two of the four red 
supergiants, all six yellow hypergiants, a sgB[e] star, a LBV in 
its cool phase and 12 OB-supergiants. All of these targets were 
observed for 2 or 3 epochs. 

The spectra were reduced with the MIKE Redux pipelin^ 
with some minor adjustments. All images were bias subtracted 
using the overscan regions. Flat field images were derived from 
internal quartz lamp frames. The slit was in place during these 
measurements to ensure the variation of wavelengths with the 
location on the CCD is similar as in the observations. To obtain 
the flat field in the dark area between the orders, a diffiising glass 



' The MIKE Redux pipeline was written by S. M. Buries, J. X. 
Prochaska and R. Bernstein in IDL l http://web.mit.edu/~burles/www/^ . 



is positioned in the optical path just downstream of the slit illu- 
minating the area between the orders. After flat fielding we still 
find spurious low frequency spatial variations in the illumination 
of the detector above ~ 8200 A for all spectra taken on August 
2009 and July 2010, including the internal quartz lamp spectra. 
These features are slightly weaker for the internal quartz lamp 
and are spread out by the diffusing glass, suggesting that these 
features are created in the optical path of the telescope. To cor- 
rect these variations we calculate a smoothed flat field, which 
no longer contains the high frequency pixel-to-pixel variations, 
but only these low frequency spatial features. Although these 
spurious features are weaker in the flat field than in the science 
images, we find that they can still be roughly corrected for by 
dividing the flat fielded image twice by this smoothed flat field. 

The curvature of the orders along the CCD is fitted using 
traces along the edge of the orders obtained from internal quartz 
lamp images. The scattered light was fitted from the dark areas 
by a B-spline with 10 knots between the orders and subtracted. 
The long slit of MIKE (5 arcseconds) allowed us to find the sky 
emission lines in the part of the order not illuminated by the star 
and subtract these emission lines. Regularly during the nights 
ThAr images were taken for wavelength calibration. Two ThAr 
lamp images were taken at different exposure times, to maxi- 
mize the range of lamp line strengths, which could be used in 
the wavelength calibration. Combined with the position of the 
orders on the CCD, these ThAr images were used to calculate a 
wavelength for every pixel on the image. The spectra are opti- 
mally extracted onto a common wavelength frame, allowing us 
to add the subsequent exposures of the same target. The noise 
was estimated at every pixel as the sum of the Poisson error in 
the observed flux and the readout noise. This calculated noise is 
consistent with the variations between multiple subsequent ob- 
servations of the same target and corresponds to a S/N of over 
100 in the red part of the spectra. The flux has been normalized 
using a cubic spline fit with two knots. 

To correct for possible off-center placement of the source star 
along the slit, we check for zero-point shifts of the wavelength 
solution in every observed spectrum. To this end we measure the 
shift of the telluric absorption lines in our stellar target spectra 
compared to a National Solar Observatory (NSO) telluric spec- 
trum convolved to the resolution of our spectra. We select sixteen 
wavelength ranges, where there are no or only very weak con- 
tamination from stellar spectral lines or diffuse interstellar bands 
and which contain strong telluric lines. Over these wavelength 
ranges we calculate the peak of the cross correlation between the 
NSO spectrum and the observed spectra. The shift of the zero- 
point of the wavelength for every spectrum is the average of the 
shifts calculated for the sixteen individual wavelength ranges. 
Figure [T] shows the difference between the individual measure- 
ments and the average offset. The distribution of these variations 
is fairly narrow (cr = 0.23 km s"'). These offsets are smaller than 
the precision we obtain in measuring the radial velocity varia- 
tions of our target stars, implying that we are not limited by the 
accuracy of the wavelength calibration. 

3. Results 

3.1. Radial velocity variations 

Radial velocity variations due to binary stars or activity in the 
stellar atmospheres can inflate our measurement of the veloc- 
ity dispersion of Westerlund I. To identify the variable stars, we 
search for radial velocity variations between epochs over our 
~1 year baseline. We calculate the velocity difference between 
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Table 1. The observed stars with the id of the star from |Westerlund| ( |1987| l , updated by |Clark et al.| ( |2005| l (Col 1), its spectral type 
( Negueruela et al. 2010 Clark et al.|2010[ l (Col 2), the deviation of velocities measured for a single epoch from the average velocity 
of that star (Col 3-5), and the maximum velocity difference between two epochs (Col 6). All velocities are in km s '. The relative 
velocities have been plotted in Figure |2] 
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See Ritchie et al. j2010f for a discussion on the spectroscopic binary W13. 
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Fig. 1. The offset of the velocity zero point measured for the in- 
dividual telluric lines from the averaged velocity zero point for 
sixteen wavelength regions containing strong telluric lines. On 
the right the histogram of all these offsets is plotted. The stan- 
dard deviation from this distribution is about 230 m/s. 



epochs on a line-by-line basis, by cross coiTelating the spectra 
around selected spectral lines. To determine the peak of the cross 
correlation on a sub-pixel level we then fit a Gaussian to the 21 
points closest to the peak. 

For every spectral line the procedure above will give us the 
spectral shift between any two epochs. We write this measured 
spectral shift between epoch / and epoch /' as AV,j. If we have 
more than two epochs, than the best estimate of the line shift be- 
tween two epochs is not only measured by the direct cross cor- 
relation between these two spectra (AV,y), but also through the 



difference of the velocities measured relative to any other epoch 
k (AVa - AVi^/). These different determinations of the spectral 
shifts will not be the same in general (i.e. AV^ AV,j; - AV^.y). 
Allende Prieto ( |2007| l showed that it is possible to combine the 
information of the spectral shift from the direct cross coiTelation 
and from the spectral shifts measured to any other epoch, to get 
a more accurate estimate of the actual spectral shift, called AV'.j 
(their equation 3). These estimates will obey AV.j = AV'-j^ - AV^^-, 
so they can be used to assign a unique velocity to every epoch, 
relative to some arbitrary zero-point. We will set this zero-point 
to the mean of the relative velocities. 



Using the method from [Allende Prieto ( 2007 1 we thus get 
an estimate of the relative velocities between the epochs for ev- 
ery spectral line. Figure |2] shows this relative velocity averaged 
over all spectral lines for every star, except the spectroscopic 
binary W13 (Ritchie et al. |2010] ). The error bars are given by 



the standard deviation of the mean, which will be an accurate 
representation of the uncertainty if it is dominated by random 
Gaussian errors such as expected if the noise in the spectra is the 
main source of uncertainty. These velocity shifts have also been 
listed in Table [T] together with the largest velocity difference 
measured between two epochs. Only three stars (W56, W238, 
and W9) are consistent with having no spectral line shifts at the 
2-sigma level. All four of these stars tend to have large uncer- 
tainties of a few km s '. For all the others the radial velocity 
variations between the epochs are much larger than the quoted 
error bars, which mean that the same spectral shift is found con- 
sistently along multiple spectral lines. Similar analysis of our 
radial velocity standards show a discrepancy between our esti- 
mates and the published values ( Udry et al. 1999 ) of about 0.4 
km s Although this is much smaller than the radial velocity 
changes we measure in our target stars, it is still slightly larger 
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Fig. 2. The radial velocity of an epoch relative to the mean of the velocities of all epochs that the star was observed in km s ' with 
1-sigma uncertainties. These relative velocities have also been listed in Table[T] The vertical dotted lines separate from left to right, 
the B-supergiants, the YHG hypergiants (and the LBV W243), the M supergiants, and the sgB[e] star W9. Only the stars W56, 
W238 and W9 are consistent with having no radial velocity valuations. 



than our error estimate, indicating that we might underestimate 
our errors. Note that radial velocity changes in our target stars 
do not have to be caused by a binary star. Instead they may be 
related to stellar activity, such as large-scale super-granulation 
or changes in the stellar winds. That atmospheric activity can 
lead to radial velocity variations of many km s"' for these mas- 
sive post-main sequence stars is well established in the literature 
(e.g. p'e Jager|ro8[|Lobel et aI]T998| ). 

That the cause of these spectral line shifts could be atmo- 
spheric as well as binaries is illustrated by the evolution of the 
spectrum of the M3 supergiant W237 shown in Figure [3] Over 
time the Ca II line at 8663 A is seen to evolve differently from 
most other lines. While the Ca II line is blueshifted in the last 
epoch (red line in Fig|3]l compared to the earlier observations, the 
other lines are clearly redshifted. These inconsistent line shifts 
are repeated throughout the spectrum and are the cause of the 
relatively large error bars for the relative velocity measurements 
of W237 compared to the other red supergiant W26 and the yel- 
low hypergiants in Figure |2] 

Another argument for the importance of stellar activity can 
be made for Wl 1, W23a and to a lesser extent W56. These stars 
show a significant shift of the spectral lines over a 1 day base- 
line in July 2010. Although this could be explained by a short- 
period binary, such short-period binaries are expected to have 
a velocity amplitude of 100 km/s. Such a large velocity shift is 
not found compared to the other epoch of these stars, observed 
on June 2010, which could be a coincidence or caused by either 
a large inclination or a low mass companion. It is unlikely that 
these scenarios can explain the observations for all stars, sug- 
gesting that the spectral line shifts are likely to be caused by at- 
mospheric variability. For the other stars, distinguishing between 
spectral line shifts, caused by binaries or stellar activity, is dif- 
ficult. However, given the high observed binary fraction among 
massive stars, we do expect a significant portion of the radial ve- 
locity variations to be caused by binary companions ( [Sana et aT] 

The stars in Table[T]and on the x-axis of Figure|2]have been 
roughly sorted by spectral type. The LBV is currently in a cool 
phase and has a lot of lines in common with the A/F hypergiants. 
Figure|2]shows two trends with spectral type. The first trend is a 
decrease in the error bars to later spectral types, which is caused 
by the far larger sample of spectral lines, most of which are less 
broad, which are available for the later type stars. The main ex- 
ception on this trend is W237, which has been discussed above 
(see Fig |3]l. The other trend is that the yellow hypergiants and 
the red supergiants tend to show lower radial velocity variations 
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Fig. 3. The spectra of the three epochs of Wdl 237 (M3 la) with 
blue corresponding to June 2009, green to August 2009 and red 
to July 2010. The Ca II Une at 8663 A is slightly blueshifted in 
July 2010 compared to the other epochs, while the other lines 
are clearly redshifted. 



than the B-supergiants. This variation stays much larger than the 
error bars, which means that this trend is consistently found over 
many different spectral lines. This could be explained by the 
fact that we are sensitive to relatively close binaries with peri- 
ods of at most a few years. [ETdridge et al.| ( |20d 8 ) showed that 
yellow hypergiants and red supergiants in close binaries evolve 
more quickly into a Wolf-Rayett star, because they lose their 
hydrogen atmosphere more effectively in binary-induced mass 
loss. So yellow hypergiants and red supergiants in close bina- 
ries spend less time in these extended phases before becoming 
a Wolf-Rayett star than if these stars were in wide binaries or 
in isolation ( jClark et al.||201 1| . This creates a selection effect, 
where observed yellow hypergiants or red supergiants tend to 
have a low fraction of close binaries. 

3.2. Velocity dispersion 

We calculate the velocity dispersion of the cluster using only 
the yellow hypergiants and the LBV. This sample was chosen 
because these stars show a small amount of radial velocity varia- 
tions compared to most other stars and they have a large number 
of spectral lines in common with each other. The large number of 
lines in common means that we can measure the velocity disper- 
sion on the basis of the cross correlation without having to deal 
with the systematics, which complicate the measurement of the 
absolute radial velocity. When trying to include more low vari- 
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Fig. 4. The velocities of all epochs of the YHG and the LBV 
relative to each other. The horizontal dashed lines show the mean 
velocity for every star. The zero-point of the y-axis has been 
arbitrarily set to the mean of all the velocities. 



ability stars in this sample, we find that the extra noise caused by 
having to place the radial velocities on an absolute scale offsets 
the advantage that we gain by having a larger sample size. The 
choice to base our measurement of the velocity dispersion on the 
cross coiTelation implies that we a direct comparison with pre- 
vious radial velocity measurements of the stars in Westerlund I 
is not possible ( Mengel & Tacconi-Garman|2007 2009 Ritchie 
let al.|2009a|b] ). 

Using the same method as in our search for radial veloc- 
ity variations, we use cross correlations to calculate the relative 
velocities between the different stars in our sample. The relative 
velocities for all the epochs of the yellow hypergiants is shown in 
Figure|4]with the mean relative velocity for every star marked as 
a dashed line. We see that the velocity variations between epochs 
of the same star is on the same order as the velocity differences 
between stars. This implies a very low intrinsic velocity disper- 
sion for Westerlund I. Based on the outlying average velocity 
and the large velocity change between epochs we identify W16a 
as a binary star (or as having a very active atmosphere) and ex- 
clude it from our calculation of the velocity dispersion. Below 
we calculate the velocity dispersion based on the other six stars. 

Due to the small number of stars in our sample we will have a 
large statistical uncertainty in the measured velocity dispersion. 
To calculate this statistical uncertainty we calculate the likeli- 
hood that the observed distribution would have been drawn from 
a cluster with a Gaussian radial velocity distribution with stan- 
dard deviation cr^ and mean //: 



exp 



(v,-A'r 



(1) 



where v, is the observed mean velocity of the star and cr, is the 
uncertainty on the measured velocity. The likelihood that the ob- 
sei-vations were drawn from a distribution with a given veloc- 
ity dispersion is calculated by integrating over With only 2-3 
epochs over a 1 year baseline and most, if not all, stars being 
variable the uncertainties on the measured radial velocities of 
the stars (o",) are not well determined. Because of this we will 
consider two cases. In the first case we set cr? to the variance of 
the velocity spread between epochs. This leads to the probability 
distribution shown as the blue dashed line in FigurelSland corre- 

1 7*1 



sponds to an intrinsic velocity dispersion of cr 



-1.7 



km s 



where the errors give the 95% confidence limits. Alternatively 
we set cr, to zero, leading to a velocity dispersion of cr = 2.1^^^'^. 
This case is shown as the red dotted line in Figure |5] Note that 




Fig. 5. Our estimate of the probability distribution of the ve- 
locity dispersion for the massive stars in Westerlund I. The blue 
dashed line shows the case where the measurement error is is 
set by the variation of the radial velocity between epochs. The 
red dotted line shows the limit, where we assume no measure- 
ment eiTors. The vertical lines mark the velocity dispersion ex- 
pected in virial equilibrium under several assumptions explained 
in Sect. [4] From the highest to lowest estimates we have the ve- 
locity dispersion calculated for a spherical, non-mass segregated 
cluster for the mass estimates from Negueruela et al. (2010 1 
(black, long dashed) and Gennaro et al., ( ,201111 (cyan, dotted). 
For the mass estimate of [Gennaro et al.| ( |2011[ ) we also show 
the expected velocity dispersion taking into account the velocity 
anisotropy, suggested by the elongation of Westerlund I (yellow, 
dashed), and the case where we take into account both the veloc- 
ity anisotropy and the possible mass segregation (green, dash- 
dot). The eiTor bars represent the quoted uncertainty in the pho- 
tometric mass of [Gennaro et al.| ( [201 l| l. 



the peak in the second case overlaps exactly with the standard 
deviation of the measured stellar velocities (2.15 km s"'). 

The second case, with cr,- - 0, can be seen as an "upper-limit 
estimate" of the probability distribution of true velocity disper- 
sion of Westerlund I. This velocity dispersion would be accurate 
if the errors were much smaller than 1 km s Given the large 
radial velocity variations measured between epochs, as well as 
the possibility of undetected binaries with a period of > 5 years 
and a velocity amplitude of tens of km s"', it is a very unlikely 
scenario that the uncertainties in our estimate of the systematic 
radial velocity are below 1 km s However, including more 
accurate measurement eiTors would only decrease the measured 
velocity dispersion, as illustrated by the case where we set the 
measurement eiTors to the variance of the velocity variations be- 
tween epochs (blue dashed line in Figure |5]l. Increasing cr, does 
not change the probability at the higher velocity dispersions sig- 
nificantly, i.e. the upper bound of the 95% confidence limit re- 
mains around 5 km s"'. Because the uncertainties are not well 
defined, we will conservatively focus on the case with cr, = 0. 



4. Discussion 

To explore the virial state of the cluster, we compare our mea- 
surement of the velocity dispersion with the velocity disper- 
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sion expected in virial equilibrium for the photometric mass of 
Westerlund I. We calculate a first estimate of the velocity dis- 
persion expected in virial equilibrium assuming Westerlund I 
is not mass segregated and that the velocities are isotropic. In 
virial equilibrium the photometric mass is equal to the dynami- 
cal mass, which under these assumptions is given by: 



yn 



(2) 



where 77 ^ 10 for the density profile found by Brandner et al. 
( 2008|l, (Trad is th e radial velocity dispersion, and ~ l.Qpc 
( Brandner et al.|[2008,) is the half mass radius. Based on an 
extrapolation from the number of massive stars in Westerlund 
I, Negueruela et al. ( |20I0 1 estimated a stellar mass of about 
10^ Mq. In virial equilibrium this coiTesponds to a velocity dis- 
persion of ~ 6.5 km s ^ This is far higher than our measured 
velocity dispersion, implying that Westerlund I is subvirial at 
the 98 % confidence level. An alternative photometric mass es- 
timate has been given by 'Gennaro et al. (2011 1. Based on an 
extrapolation of the measured IMF for the intermediate mass 
stars between 3.5 and 27 Mq, they found a lower stellar mass 
of 4.9^Q 5 X 10'*Mq. In virial equilibrium we would then expect 
a velocity dispersion of 4.6^^ 3 km s"'. For this velocity disper- 
sion our measurements still favor a subvirial cluster at the 91 % 
confidence level. 

In the conversions from the photometric mass to the veloc- 
ity dispersion calculated above we have assumed that the cluster 
has an isotropic velocity distribution and that it is not mass seg- 
regated. Both of these assumptions are unlikely to be valid for 
Westerlund I. An isotropic velocity distribution would generally 
lead to a spherical cluster. However, Westerlund I is observed to 
be elongated, with surfaces of constant density lying on ellipses 
rather than circles ( Gennaro et al.|20lT) . The exact implications 
of this elongation on the expected velocity dispersion depend 
on its origins. Under the assumption that this elongation is sup- 
ported by a velocity anisotropy and thus is not a transient on the 
dynamical timescale, we calculate in appendix |A] the expected 
velocity dispersion. We find that the dynamical mass is given by 
eq A. 7 reproduced here: 



|et al.|2003| [Negueruela et al.|2010| l. The second population con- 
tains the remaining low and intermediate mass stars in the clus- 
ter, with a half-mass radius of 1.0 pc and a much higher total 
mass of 4.3 x 10'* Mq ( [Brandner et al.|2 008 ). For a cluster whose 
stellar volume density is well-represented by a Plummer sphere 
we find that in virial equilibrium the velocity dispersion of the 
total population is about 10% larger than for the subset of mas- 
sive star using a two-fluid approximation (eq 7.158 in Binney & 
Tremaine 2008 ). This would lead us to expect to measure a ve- 
locity dispersion of roughly 4. 1 ^[j j km s" ' for a mass segregated, 
elongated cluster. 

Besides showing the probability distribution of the velocity 
dispersion of the massive stars in Westerlund I implied by our 
observations. Figure [5] also illustrates the velocity dispersions 
expected in virial equilibrium using the various assumptions dis- 
cussed above. Even for the lowest estimate our obsei-vations still 
strongly favor that Westerlund I is subvirial or virial. For the 
probability distribution including a rough estimate of our mea- 
surement errors we find a 91 % probability of the observed veloc- 
ity dispersion to lie below the lowest estimate of 4. 1 km s"' . For 
the more conservative probability distribution, which assumes 
no measurement errors, we still find a 87% probability. 

For the cluster to be unbound, the kinetic energy has to be 
higher than the binding energy, which would correspond to a ve- 
locity dispersion V2 higher than expected in virial equilibrium. 
This can be excluded with at least 97 % confidence from our 
measurements, with the exact percentage depending on the as- 
sumptions made. This implies that Westerlund I is bound and 
has survived any recent gas expulsion. To survive the gas expul- 
sion Westerlund I should either have had a high star-formation 
efficiency, which would cause the cluster to remain close to virial 
equilibrium or the stars in the cluster should have been dynami- 
cally cold at the moment the gas was expelled (Goodwin' 2009[ l. 
If the case of Westerlund I is typical for a massive cluster, this 
would mean that the excessive velocity dispersion measured for 
extragalactic clusters does not automatically imply a supervirial 
cluster, but has another source, for example binaries (Gieles et al. 
2010). In any case, Westerlund I itself will not disperse into the 
field quickly and will likely persist for billions of years. 



Md 



yn 



'/ellips'?- 



(3) 



where rf^^ is the half mass radius, if the density profile in all di- 
mensions was the same as along the semi-major axis, r^^ can be 



estimated from the half-mass radius of 1.0 pc (Brandner et al. 
20081 1 and the ellipticity of Westerlund I of 0.75 ( [Gennaro et al. 
2011 1 to be roughly 1.2 pc. The correction factor for the ellip- 



ticity of the cluster T/eiups is approximately 0.85, if the axis along 
the line of sight is not much smaller than the observed axes. The 
other parameters in the equation are the same as in eq [2l For the 



photometric mass estimate of 4.9;^,,? x 10 Mq (Gennaro et al 



201 Ij l taking into account the elongation leads to a only slightly 



lower expected velocity dispersion of 4.5^g2 km s 

Due to the tendency of massive stars to reside in the center of 
massive clusters (i.e. mass segregation) the velocity dispersion 
of the massive stars which we observe might not representative 
of the cluster as a whole ( |Fleck et al. 12006) ). To calculate the ef- 
fect of mass segregation on the difl'erence between the measured 
and expected velocity dispersion, we separate the cluster in two 
populations. The first population represents the massive post- 
main sequence stars from which our sample was drawn ^ with 
a half-mass radius of 0.44 pc and a mass of 6 X 10^ Mq ([Clark] 



5. Conclusions 

We present multi-epoch high resolution spectra of 22 massive 
post main-sequence stars in the young, massive. Galactic clus- 
ter Westerlund I. For 21 of these 22 stars the velocity variations 
have been measured between multiple epochs by cross coiTela- 
tion (see Table [T] or Fig|2]l. Many of these stars seem to show 
radial velocity variations at the one to ten km s"' level. Some 
of these variations might be explained by an underestimate of 
our uncertainties. Alternatively these variations could be caused 
either by binaries or by instabilities/waves in the complex atmo- 
spheres of these super- and hypergiants. 

Using the five least variable yellow hypergiants and the LBV, 
we find a velocity dispersion for the massive stars in Westerlund 
I of cr = 2.1^g g km s"' with a confidence limit of 95%, assum- 
ing no significant errors in measured radial velocity differences 
between the stars. In reality even these least variable stars show 
radial velocity variations between epochs of a few km s"', im- 
plying that the quoted velocity dispersion is only an upper limit 
of the true velocity dispersion. 

In Figure|5]the probability distribution of the velocity disper- 
sion from our measurements is compared to several estimates of 
the velocity dispersion which we expect in virial equilibrium. 



6 



M. Cottaar et al.: Is the massive young cluster Westerlund I bound? 



These estimates have been calculated using different estimates 
of the total mass of Westerlund I, as well as different rough ap- 
proximations of the effects of possible mass segregation and the 
effects of a velocity anisotropy, which is suggested by the ob- 
served spatial elongation of Westerlund 1. For all of these cases 
the observations favor a subvirial cluster. For the lower photo- 
metric mass estimate of |Gennaro et al.| ( |2011[ l the observations 
are consistent with the cluster in virial equilibrium. In all cases 
we exclude that Westerlund I is significantly supervirial, imply- 
ing that the cluster is bound, has survived any recent gas ex- 
pulsion. Barring any violent interactions with other clusters or 
molecular clouds, Westerlund 1, which might be the most mas- 
sive young cluster in our Galaxy, is expected to survive for bil- 
lions of years. 
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Appendix A: Velocity dispersion in an ellipsoidal 
cluster 

The density distribution of both the massive and intermediate 
mass stars in Westerlund 1 has been shown to be elongated 
( [Negueruela et al.|2010 [Gennaro et al.|20lT i. Here we will cal- 
culate the effect of this elongation on the dynamical mass calcu- 
lation of Westerlund 1, using an ellipsoidal density profile. Such 
a profile is suggested by [Gennaro et al . (201 1), who fitted a den- 
sity profile to their stellar density distribution of Westerlund 1, 
where the lines of constant densities lie along ellipses instead of 
circles. To limit the number of free parameters as well as greatly 
simplify the equations, we will assume that one of the axis of the 
ellipsoid lies along the line of sight. The other two axes are then 
the axes found by [Gennaro et al. (201 1 1 in the plane of the sky. 
We define the length of the semi-major axis found by Gennaro 
et al. (201 1 1 as fli, the length of the semi -minor axis as 02 and 
than have as the length along the line of sight, which is a 
unconstrained by observations. 

For an ellipsoidal cluster in virial equilibrium the elongation 
implies a velocity anisotropy, which is given by the tensor form 
of the virial theorem 



0, 



(A.l) 



where IKjj = M^y„crj is the tensor form of the kinetic energy 
along the axis j and Wjj is the potential energy tensor For an 
ellipsoidal system this potential can be split up in a product of 
two factors (see eq 2.144 in Binney & Tremaine [2008): (i) which 
only depends on the axial ratios of the cluster; and (ii) which is 
independent of the cluster's ellipticity or along which axis the 
potential energy tensor is calculated and only depends on the 
density dropoff along a single axis, which we take to be the ob- 
served semi-major axis. In functional form this corresponds to 



^jJ = -fjia\,a2, 03) * gip-Aong 



semi-major axis 



with 



(A.2) 



(A.3) 



where the A , are a complex func tion of the axial ratios given by 
Table 2.1 in Binney & Tremaine (2008 ). Note that the equations 
in this table assume a\ > a2> 03, which will make it necessary 
to rearrange the indices, if the axis along the line of sight is not 
the smallest. 

Because g does not depend on the 02 and 03 we can calculate 
this term for the spherical case, where we set a2 and 133 to oi. 
For a spherical cluster we have Aj - Ijli, so fj{a\,a2,aT,) - 1 
and Wjj - \W, where W is the total potential energy. Using 
the equation for the dynamical mass of a spherical cluster (e.g. 
Portegies Zwart et al.|2010| l we get 



iGM^yn 



(A.4) 



hm 



where ^ 10 ( Portegies Zwart et al. 12010)1 for the dens ity pro- 
files fitted by jBrandner et al .'(2008) and [Gennaro et al.| ( pOTT] l, 
Mjyjj is the mass of this circular cluster and is the half mass 
radius of this cluster. The dynamical mass of this artificial spher- 
ical cluster is related to the mass of the spheroidal cluster with 



M' 

dyn 
Mdyn 



1 

Cl\Cl2^7> 



(A.5) 
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Fig. A.l. The correction factor T/eiiips needed to take into account 
the spheroidal shape of the cluster plotted against the ratio of 
the length of the axis along the line of sight to the length of 
the observed semi-major axis for an observed ellipticity of 0.75 
(a2/ai ^ 0.66). The cases where the density dropoff along the 
line of sight is the same as the semi-major axis (dashed) and the 
same as the minor axis (dotted) are shown. 



Entering equations |A.3|A.5| in the equation for the ellipsoidal 
potential energy tensor gives 

Wn--l^Aj'-^. (A.6) 
2fl2fl3 r] r' 



Combined with the virial equation A. 1 this can be converted for 



the radial velocity dispersion {j - 3) as: 
2 / 

Mdyn = ?7ellips?7 '"^ (A.7) 

where the correction for the ellipsoidal structure has been com- 
pletely integrated in T/eiups, which is defined as 

2 fl2 1 

'7eiiips = ^^ — ^r- ^^■^> 

3 03 A3 

This coiTection factor only depends on the axial ratios. For an 



ellipticity in the line of sight of 0.75 (fl2/«i ~ 0.66) ( [Gennaro 
et al.|20l Tl, we plot the dependence of this correction factor on 



03 /ai in Fig. |A.l| This figure shows that there is only a small de- 
pendence of the coiTection factor on the actual length of the axis 
along the line of sight for a^/ai > 0.5. In this range an increase 
in the length of the axis, causes the total potential and thus the 
total kinetic energy to drop. However, this drop in kinetic energy 
mostly corresponds to a lowering of the velocity dispersion in 
the plane of the sky, with the radial velocity dispersion staying 
roughly constant. This interplay between the change in the total 
kinetic energy and the division of this kinetic energy over the 
three axes produces the curve shown in Fig.|A.l| 
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